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Abstract. By relating the number of images of a function with finite domain 
to a certain parameter, we obtain both an upper and lower bound for the image 
set. Even though the arguments are elementary, the bounds are, in some sense, 
best possible. The upper bound is also connected to triangular numbers, and a 
slight improvement to this bound could be obtained by resolving a problem on 
them. In the final section, we consider implications of our bounds in various 
settings, including finite fields, coding theory and additive combinatorics. In 
particular, we obtain the first non-trivial upper bound for the image set of a 
planar function over a finite field. 



Let A and B be sets, with A finite of order n, and let / : A — > B. We define the 
following notation, which will be used throughout this article. 

• The number of distinct images of / is denoted V(f). That is, V(f) = \f(A)\. 

• For r £ N, M r (f) is the number of y G B for which f(x) — y has r solutions. 

• Since A is finite, clearly M r (f) = for all sufficiently large r. We therefore 
define m to be the largest integer for which M m > 0. 

• For each integer r > 2, set N r (f) to be the number of r-tuples (xi, . . . , x r ) 
with Xi — Xj if and only if i = j which satisfy f{x\) — f{x%) = ■ ■ ■ = f{x r ). 

Several identities follow immediately from these definitions. 



In this paper we are interested in the relationship between V(f) and N s (f) for 
a fixed s. Intuitively, knowledge of N s (f) should imply some knowledge on V(f), 
and knowledge of N s (f) should yield more knowledge concerning V(f) than N s i(f) 
would for s' > s. Our main result is to obtain bounds for V(f) in terms of N a (f) 
which confirm this intuition. Moreover, when s = 2, our lower bound is tight for 
any value of N2(f), while our upper bound is tight in infinitely many cases. Our 
main theorem can be given in the following form. 

Theorem 1.1. Let f : A -> B with \A\ = n. Then 



For a more exact result when s = 2, see Theorem 14.11 below. Sections 2 and 3 
prove the upper and lower bounds for general values of s, while Section 4 treats the 
special case of s — 2. Section 5 contains several different applications. 



1. Introduction 



w.#i n/o=£r=iM-(/). 

Id.#2 n = J2Z 1 rM r (f). 
Id.#3 N.(f) = YZL.P(r,s) 



)M r {f). 
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2. A LOWER BOUND FOR V(f) WHEN N s (f) IS KNOWN 

For convenience, we set N s (f) = t. By the definitions above, 

s—1 tci 

^2rM r =n-t + ^2(P{r,s)-r)M r . (1) 

r— 1 r—s 

(We note that, since the snm on the right is at least m(m — 2), we must have 
Ylr=i r M r > Max(0, n — t 4- m(m — 2)).) We may manipulate fT]) as follows: 

s— 1 m 

V] rM r = n - 1 + ( p ( r > s ) _ r ) M r 

r—1 r—s 

m 

= n-t+(s\ - s)M s + 53 ( p ( r > s ) _ r ) M r 

r—s-\-l 

m 

>n-t+ (s!- s)M s + (s!- 1) 53 rM r 

r— S+l 

71 S — 1 

= n-t+(s\- s)M s + (si - 1) J] rM r - (s! - 1) 53 r^r - (s! - l)sM s 

r=l r=l 
s-1 

= a!n - 1 + s! (1 - s)M a - (s! - 1) 53 rM r- 

r=l 

Rearranging, we find 

s-1 

s\n-t< s! 53 rM r + s\ (s - 1)M S 

r=l 

s-1 

< s! (s - 1) 53 M r + s! (s - f )M S 



= s! (s - 1) 53 M ^ 

r=l 

< S !( S -i)n/), 

which establishes our lower bound. (We mention, in passing, that this proof is a 
generalisation of the lower bound obtained by Matthews and the first author [5] ; it 
was that note that formed the motivation for this article.) 



3. An upper bound for V(f) when iV s (/) is known 
First, we note that M m > 0, so we have that P(m, s) < t, which (roughly) yields 

m<t> +0(tsTT). (2) 
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Now, we apply definitions above to see that 
t = N a (f) 



= J2P(r,s)M r 

r=l 

m 

<mrP(r-l,s- l)M r 

r=l 

m 

< m ■ P{m - 2, s - 2) J^(r - l)M r , 



r=l 



from which we deduce 

m 

V(r - l)M r > ; -. (3) 

^ V ' m ■P(m-2,s -2) v ; 

r— 1 x ' 

Combining (J5J) and ([3]), we get 

m 

£(r-l)M r >^-0(*iTi). (4) 

Now, we estimate V(f) from above using this sum as follows. 

V(f)=n-n + V(f) 



n — 

r=l 



^2 rM r ~ ^2 M r 

r—1 r—1 

m 



Applying Q yields 
as claimed. 



V(f)<n-t» +0(t^), (5) 



4. The case where N 2 (f) is known 



We pay particular attention to the case s = 2 because it is more likely that one 
has information on pairs of elements with the same image than, say, 3-tuples or 
4-tuples. Also, the upper bound can be made explicit in this case. 

Theorem 4.1. Let f : A ->• B with \A\ = n and set N 2 (f) = t. Then M x (/) > 
Max(0, n — t) and 

t 2t 
n- - < Mi(f) + M 2 (f) < V(f) <n ■ 

The lower bound is precisely that from before, while the upper bound follows 
from a careful re-working of the proof of the upper bound. We omit the details. 

It is easy to see that, provided N 2 (f) < 2n, this lower bound is tight, as one can 
easily construct functions that meet this bound. Set N 2 (f) = t. Randomly choose 
t distinct elements xi, x 2 , . . . , xt € A and t/2 distinct elements yi, y 2 , . . . , y t / 2 G B. 
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For 1 < i < t/2, assign f(x2i-i) = f(x2i) = Hi- At this point, Ni^f) = t, so that / 
must be 1-1 on A \ {xi, . . . , x t }. It follows that V(f) = |+ n — t = n — |, which 
is the lower bound. 

It is clear from symmetry that A 2 (/) = t is necessarily even. Set t — 2k. Then 
the bounds read 

Ak 

n-k<V(f)<n 

It is natural to ask when is y/Sk + 1 <£ Z? Interestingly, 8fc + 1 is a square precisely 
when A: is a triangular number. In such cases, we have k = u(u — l)/2 for some 
integer u, 8k + 1 = S 2 where S = 2u — 1, and the upper bound simplifies neatly to 

V(f)<n- S —!-=n+l-u. 

In all cases where k is a triangular number, there exist functions which attain this 
bound. To construct such a function, choose u elements X\,X2, ■ ■ ■ , x u <E A and set 
f(xi) — f{x 2 ) = ■ ■ ■ = f(x u ). Now set / to behave 1-1 on the remaining elements 
of A. It can be seen that N%(f) = 2k and that the upper bound is attained. 

In all cases where k is not a triangular number, our upper bound is not exact. 
To make our upper bound tight, one needs to solve the following problem: 
Let T r = for any r e N, and fix k e N. By a triangular sum of 
length I for k we mean any instance of the equation 

l 

k = ^ ' T Ti , 

i=l 

where r\ > r 2 > ■ ■ ■ > ri . The weight of a given triangular sum 
is given by — I + (^ i=1 ri). Given k, we define to be smallest 
weight among all triangular sums for k. Find a formula for B^. 

Clearly, when k = T u , Bk = u — 1, but we do not know of a general formula for 
Bk- While Gauss famously proved that there exists a triangular sum for any k 
with length at most 3, it may not necessarily be the case that one such instance 
will provide the value for Bk- The connection to our bound should be clear: If 
7V 2 (/) = 2k, then V(f) < n~ Bk, with equality always possible. 

5. Applications 

We now consider a variety of implications and possible applications of our bounds 
in various settings. 

5.1. Finite fields and A^(/). We first introduce some notation. Let p be a prime 
and set q = p e . We use the standard notation of ¥ q for the finite field of q elements, 
F* for the non-zero elements of ¥ q , and F 9 [X] for the ring of polynomials over ¥ q in 
X. Every function over ¥ q can be represented uniquely, via Lagrange interpolation, 
by a polynomial of degree less than q, and the reduced form of a polynomial / e 
F g LY] is that polynomial g(X) given by g(X) = f(X) mod (X q —X). A polynomial 
/ € F 9 [X] is a permutation polynomial over ¥ q if V(f) = q. 

Research concerning the value of V(f) for polynomials over finite fields is exten- 
sive; we restrict ourselves to discussing a few outstanding general results. It is clear 
that, for lower bounds, there are obvious limits to the results you can expect to 
obtain - obviously V(f) > 1 with equality possible, while for polynomials of given 
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degree d, V(f) > 1 + is clear. That said, we have the following deep result by 
Cohen [5] concerning the average lower bound of V(f). 

Theorem 5.1 (Cohen [2]). Let f E F q [X] be of the form 

d-l 

f(x) = x d + Y, a * xl - 

1=1 

Let t be any integer such that < t < d — 2 and let dd-i, ■ • ■ , ad-t be fixed. Define 
v(d, t) = J2 V(f)/ < l d ~ t ~ 1 j where the sum is over all a\, . . . , a<j— t-i< Set m = \_(d — 
t)/2\. Then v(d,t) > c(q,m)q, where 

c(q, m) = 1 - 

Setting t = d — 2 in Cohen's result, we find that, in particular, on average, 
V(f) > 5f_T > f • 

A specific lower bound was obtained by Wan, Shiue, and Chen [141 under an 
additional condition on the polynomial. For / e F g [Jf], define u p (f) to be the 
smallest positive integer k such that X^eF f( x ) k 0- If n0 such k exists, define 
u p(f) = °°- 

Theorem 5.2 (Wan, Shiue, Chen [14]). If u p (f) < oo, then V(f) > u p (f) + 1. 

The authors note that u p (f) > [ Dc g re ^) J , so that under the conditions, their 
bound is at least as good as the obvious bound noted above. 

In terms of an upper bound, there is the following general bound by Wan 13J, 
given in terms of the degree of the polynomial. 




Theorem 5.3 (Wan [13]). Let f 6 F q [X]. If f is not a permutation polynomial 
over F q , then 

A better bound was obtained in P3] using p-adic techniques. To avoid unneces- 
sary technical details, we simply refer the interested reader to [13] , Theorem 3.1. 

Integral to applying either of our bounds is having knowledge of N s (f) for some 
s. For simplicity, we only discuss the case s = 2 here. We do not feel this is 
particularly limiting as, of the values of N s (f), knowledge of N2(f) seems most 
likely. We approach this issue in two ways: first, we establish the expected value 
of A^(/) for any polynomial / <G F g [X]. We then consider classes of polynomials 
which meet this expected value. 

For our next result, we need some further notation. Denote the standard trace 
mapping from F q to F p by Tr. Let w be a primitive pth root of unity. Recall that 
the canonical additive character, of F g is defined by xi( x ) — w Tr ( x ) for any 
x € Fq, and that all additive characters of F q are given by Xh{x) — xi(hx) for any 
he Fq. The following result is a straight generalisation of a result of Carlitz p]. 

Lemma 5.4. Given a random polynomial f £ F q [X], the expected value of N 2 (f) 
is q — 1 . Equivalently. for any f £ F q [X] , 

^N 2 (f(X) + aX) = q(q-l). (6) 
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Proof. Fix a polynomial, / G F g [A]. By the definitions above, 

q(N 2 (f) + q) = q(\{(x, y) : f{x) = f(y),x, y G ¥ q , x ^ y}\ 
+ \{(x:f(x) = f(x),xe¥ q }\) 

= E E Xh{f{x)-f{y)). 

h£¥ q x,yGF q 

To generate our average value for N 2 (f), we consider the average over the set 
{/(A) + aX : a G FJ. We have 

q(N 2 (f(x)+ax))+q)=J2J2 E Xh(f(x)-f(y)+a(x-y)) 



ae¥ q ae¥ q he¥ g x,y£¥ q 



9 3 +E E Xh(f(x)-f(y))J2xh(a(x-y)) 

he¥' q x,y£F q a£F q 



h=£Qx£F q 
,3 , 2/ 



= q d + q (9-1), 

where, in the second to last line, we have exploited the fact X^aeF x( a ( 2; — u)) = 
unless x = y. Comparing the left and right hand sides yields 



aeF, 



N 2 (f(X)+aX) = q(q-l). (7) 



The claimed expected value of N 2 (f) now follows at once, for we can, of course, 
partition the set of polynomials into equivalence classes, with two polynomials being 
equivalent if they differ only by a linear term aX: the average value of N 2 (f) for 
the polynomials in any equivalence class is q — 1 by ([7]). □ 

Now suppose / € F 9 [X] is a polynomial for which N 2 (f) = q — 1, the expected 
value. Applying our lower bound, we find V(f) > 2±i, which is more or less the 
same as that obtained by Cohen's result. In the other direction, applying our upper 
bound to /, we find 

1 + V4<7 - 3 

We cannot, of course, compare this directly to the result of Wan's, for we do not 
know if N 2 (f) = q — 1 has any direct implication on Degree(/). 

Given Lemma HT4l one obvious question arises: Is it possible to describe classes of 
polynomials for which the expected value for N 2 (f) is obtained? Are there natural 
conditions on / which force N 2 (f) = q — 1? We now discuss, for q odd, several such 
conditions (the case q even is clearly impossible for N 2 (f) is necessarily even). 

For any a G F*, we define the difference polynomial, Af, a (X) — A a (A), to be 
the polynomial given by A a (A) = f(X + a) — f{X). A polynomial / G ¥ q [X] 
is planar over ¥ q if, for every a G F*, the polynomial A a (X) is a permutation 
polynomial over ¥ q . An equivalent definition for planarity is that \Sh{f(X)+aX)\ = 
\J2xew q Xh(f{x) +ax)\ = y/q for all a, h G ¥ q , h f 0. 

Consider the following conditions on a polynomial / G F g [X]: 
C\ . / is planar over ¥ q . 

c 2 . iorhew;, \S h (f)\ = y/q. 

C3. for all a G F*, the polynomial Af ya (X) has a unique root. 
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C 4 . N 2 (f)=q-l. 

Clearly, C\ — > Ci and Ci — > C3 — > C4. It is shown in the proof of [5], Theorem 1, 
that C2 — > Ci, while a counting argument, also given in [5], shows C\ ^ C2. 

The relationship between C2 and C3 is less clear. Computations show that 
they are almost certainly inequivalent for sufficiently large q. Over F3, they are 
equivalent; over F5, they are not, though (C2 A C3) — ¥ C\. For q g {7,9}, they are 
inequivalent, and 

• there exist polynomials which satisfy both C'2 and C3 but not C\\ for ex- 
ample, f{X) = X 4 + 2X 2 e ¥ 7 [X]; and 

• there exist polynomials which satisfy one or other but not both conditions; 
for example, with g a primitive element of Fg, X 7 + gX 2 satisfies C2 but 
not C3, while X 8 + gX 2 satisfies C3 but not C2. 

This also shows C2 ^ C4 and C3 ^ C4. We suspect that the following statement is 
true, though we have no direct idea of how to establish it. 

Conjecture 5.5. For any finite field of any characteristic, the number of polyno- 
mials satisfying C3 is greater than or equal to the number of polynomials satsifying 
(■>■ 

Planar polynomials are closely related to affine planes [H |6] , semifields [3] , and 
difference sets [7J [TO]. Consequently, they have received a significant amount of 
attention. The bound given by ([8]) is, however, the first non-trivial upper bound on 
the size of the image set of a planar function. To that end, we have the following 
conjecture, attributed to Dembowski and Ostrom. 

Conjecture 5.6. If f E ¥ q [X] is planar over ¥ q , a field of characteristic p > 5, 
then the reduced form of f has the shape 

J2^jXP'+P 3 +Y / bkXP" +c. 

i,j k 

It can be seen that, if the conjecture is true, then Wan's result, Theorem 15.31 
gives the upper bound 

for any planar / £ ¥ q [X]. This is substantially worse than that implied by our 
Theorem 14.11 as given in ((8|). That our result so clearly outperforms Wan's is not 
surprising, for Wan's result applies to all polynomials of a given degree, while our 
result is being applied specifically to those / satisfying A^(/) = q — 1, a condition 
implied by planarity. That said, we also believe that (JSJ can be improved signifi- 
cantly for planar functions for we, too, are not utilising the full set of restrictions 
implied by the planar property. 

5.2. Arithmetic combinatorics. Here, we present a setting where 7V2 arises 
rather naturally. Let G be a (not necessarily abelian) group. For subsets A,BcG, 
define the product set of A and B to be 

A - B = {ab : a e A,b e B}. 

Much interest revolves around the relative sizes of A, B, and A - B. Some examples 
are the Cauchy-Davenport Theorem, Plunnecke-Rusza inequalities, and Freiman's 
Theorem; see the books by Nathanson [9] or Tao and Vu [12] . One useful tool for 
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these questions is the concept of energy. Various types of energy bounds have been 
the key ingredient in many recent results, such as the current best known sums and 
products bound due to Solymosi 

Given G, A, and B as above, we define the energy, E(A, B) to be 

E(A, B) = \{(a, a' ,b,b') £ A x A x B x B : ab = a'b'}\. 

If we consider / : A x B — >• G, f : (a, b) M- ab, we get a very close relationship 
between N 2 (f) and E(A,B), namely 

N 2 (f)=E(A,B)-\A\-\B\, 

where we removed the "diagonal" elements of the form (a, a, 6, 6) from the energy 
count. With this in mind, the following is a direct application of Theorem 14. 1[ 



Corollary 5.7. Let G be a group, A, B C G and set n — \A\ ■ \B\. Then we have 

Zn ~ E ^ B K\A.B\<n =W4,*)-n) (9) 

2 -' '- l + y/A(E(A,B)-n) + l 

Notice that these bounds are most effective when energy is small. 

5.3. Coding theory. Our final application is in coding theory. Much is known 
about the interplay between the redundancy of a given code and the amount of 
information that can be communicated per unit time; see Hall's notes on coding 
[8], for a good introduction. Here, we investigate messages transmitted through a 
noisy medium. 

Consider a function / : C —> M, where C is the codespace and M is the message 
space. In order to increase the likelihood that a message is decoded properly, even 
with errors in transmission, we will often give a single message word more than one 
code word. That is, it will often be the case that /(c) = f(c') for distinct c, d € C. 
By definition, V(f) will be precisely the number of distinct words in M, and N 2 (f) 
will be the number of times that two code words represent the same message. 

There are situations in which one has a particularly uneven message space, where 
a small number of messages have high priority, and need the best chances of being 
decoded correctly, while all remaining messages are less important, and their incor- 
rect decodings would have very little consequence. For example, a message space 
between fire towers in a forest could have a small number of special words about the 
existence or severity of a fire, and the other words could describe other, less impor- 
tant details, like the weather, in the case that there is no fire. Similar applications 
exist in a variety of different contexts such as operations in hostile environments. 
In such situations, an application of Theorem 14.11 yields the following. 



Corollary 5.8. In a code with a codespace C, a message space M., an assignment 
function f : C — > M., and t = \{f(c) — f(c') : c, c' *E C , c =/= c'}|, we have 

t ,„, 2t 
n--<\C\<n~ 



2 ~ 1 + V4i+1 

In this sense, our bounds provide a guide for balancing between levels of redun- 
dancy and flexibility within the code. 
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